Fault-Tolerant Distributed Simulation: A Position Paper
نویسنده
چکیده
Distributed Simulation is characterized by the fact that a simulation system is executed on multiple computing nodes that cooperate by exchanging messages. Regardless of the reasons for using distributed simulation in the first place (e.g. performance reasons), the execution of a distributed simulation depends on the proper functioning of all of the processing nodes and the underlying network. Depending on the level of reliability neccessary for a simulation system, the integration of fault-tolerance mechanisms is crucial. It turns out that there has not been much work on fault-tolerance in distributed simulation. The intention of this paper is to summarize the existing work and to point out possible research topics in this area.
منابع مشابه
A generalized ABFT technique using a fault tolerant neural network
In this paper we first show that standard BP algorithm cannot yeild to a uniform information distribution over the neural network architecture. A measure of sensitivity is defined to evaluate fault tolerance of neural network and then we show that the sensitivity of a link is closely related to the amount of information passes through it. Based on this assumption, we prove that the distribu...
متن کاملFault Tolerant Framework in MPI-based Distributed DEVS Simulation
Distributed DEVS simulation plays an important role in solving complex problems for its reuseability, and composability of component models. Using MPI to be the communication middleware, the distribution increases the performance. But even the tiny faults of computing resources can lead to crash. Hence Fault Tolerant is necessary to maintain the simulation reliability. This paper introduces a D...
متن کاملDistributed Adaptive Fault-Tolerant Consensus Control of Multi-Agent Systems with Actuator Faults
This paper presents an adaptive fault-tolerant control (FTC) scheme for leader-follower consensus control of uncertain mobile agents with actuator faults. A local FTC component is designed for each agent in the distributed system by using local measurements and certain information exchanged between neighboring agents. Each local FTC component consists of a fault detection module and a reconfigu...
متن کاملReal-time Fault-tolerant Scheduling in Heterogeneous Distributed Systems
∗ This work was supported by National Defense Pre-research Foundation of China. Abstract: Some works have been done in addressing real-time fault-tolerant scheduling algorithms. However, they all based on homogeneous distributed systems or multiprocessor systems, which have identical processors. This paper presents two fault-tolerant scheduling algorithms, RTFTNO and RTFTRC, for periodic real-t...
متن کاملEfficient Scheduling Algorithm with Fault-tolerance for Real-time Tasks in Distributed Systems
As real-time fault-tolerant scheduling is one of the main research areas in real-time fault-tolerant techniques, this paper proposes an efficient scheduling algorithm for BKCL(EBKCL). EBKCL can schedule the tasks with the fault-tolerant requirements(FTR) together with tasks without FTR. It is assumed in BKCL that there are no overlaps between the backup copies, however, the backup copies are al...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003